Picture for Jiaming Ji

Jiaming Ji

SPADE-Bench: Evaluating Spontaneous Strategic Deception in Agents via Plan-Action Divergence

Add code
Jun 01, 2026
Viaarxiv icon

SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning

Add code
Jun 01, 2026
Viaarxiv icon

MiraBench: Evaluating Action-Conditioned Reliability in Robotic World Models

Add code
May 28, 2026
Viaarxiv icon

RedVLA: Physical Red Teaming for Vision-Language-Action Models

Add code
Apr 24, 2026
Viaarxiv icon

ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling

Add code
Mar 18, 2026
Viaarxiv icon

VISA: Value Injection via Shielded Adaptation for Personalized LLM Alignment

Add code
Mar 05, 2026
Viaarxiv icon

What, Whether and How? Unveiling Process Reward Models for Thinking with Images Reasoning

Add code
Feb 09, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

Enhance the Safety in Reinforcement Learning by ADRC Lagrangian Methods

Add code
Jan 26, 2026
Viaarxiv icon

VLA-Arena: An Open-Source Framework for Benchmarking Vision-Language-Action Models

Add code
Dec 27, 2025
Viaarxiv icon